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DescHption 

Background of the invention . i. , ^ *u 

This invention relates to the application of recombinant DNA technology 1o the production of 
5 polypeptide in vertibrate cell cultures. More specifically, this invention relates to utilizing the coding, 
sequence for a secondary control polypeptide as a tool in controlling production of a foreign polypeptide 

by the vertebrate cell culture. , ^ , * : • 

The general principle of utilizing a host cell for the production of a heterologous protein— i.e., a protein 
which is ordinarily not produced by this celMs well known. However, the technical difficulties of obtaining 

10 reasonable quantities of the heterologous protein by employing vertebrate host cells which are desirable 
by virtue of their properties with regard to handling the protein formed are many. There have been a 
number of successful examples of incorporating genetic material coding for heterologous proteins into 
bacteria and obtaining expression thereof. For example, human Interferon, desacetyUthymosin alpha-!, 
somatostatin, and human growth hormone have been thus produced. Rf<^"t*y' ^ I^Vf c.^o 

IS utilise non-bacterial hosts such as yeast cells (sec, e-g., co-pending application, U.S. Serial No. 237,913, 
filed February 25, 1981; EPO Publication No, 0060057) and vertebrate cell cultures (U,S. Application Serial 
No 298 235 filed August 31, 1981; EPO Publication No. 0073656) as hosts. The use of vertebrate cell 
cuitures'as hosts In the production of mammalian proteins is advantageous because such systeriis have 
additional capabilities for modification, glycosylation, addition of transport sequences, and other 

20 subsequent treatment of the resulting peptide produced In the cell. For example, while bacteria may be 
successfully transfected and caused to express "alpha tiiymosin", the polypeptide produced lacks the 
N-acetyl group of the "natural" alpha thymosin found in mammalian system. 

In general, the genetic engineering techniques designed to enable host cells to produce heterologous 
proteins include preparation of an "expression vector" which is a DNA sequence containing, 

25 (1 ) a "promoter", i.e., a sequence of nucleotides controlling and permitting the expression of a coding 

sequence; ^. „ . 

(2) a sequence providing mRNA with a ribosome binding site; . , , , . 

(3) a "coding region" i.e., a sequence of nucleotides which codes for the desired polypeptide; and 

(4) a "termination sequence" which permHs transcription to be terminated when the entire code for the 

30 desired protein has been read; and „ „ „ - * ^Mr^u 

(5) if the vector is not directly Inserted into the genome, a "replicon or ongin of replication which 
permits the entire vector to be reproduced once it is within the cell. 

In tiie construction of vectors for the method of the present invention, the same promoter controls two 
coding sequences, one for a desired protein, and the other for a secondary protein. Transcription 
35 termination is also shared by these sequences. However, the proteins are produced in discrete form 
because they are separated by a stop and start translational signal. 

Ordinarily, the genetic expression vectors are in the form of plasmids, which are extrachromosomal 
loops of double stranded DNA. These are found in natural form in baceteria, often in multiple copies per 
cell However, artificial plasmids can also be constructed, (and these, of course, are the most useful), by 
splicing together the four essential elements outlined above in proper sequence using appropriate 
"restriction enzymes". Restriction enzymes are nucleases whose catalytic activity is limited to lysing at a 
particular base sequence, each base sequence being characteristic for a particular restriction enzyme. By 
artful construction of the terminal ends of tiie elements outiined above (or fractions thereof) restriction 
enzymes may be found to splice these elements together to form a finished genetic expression vector. 

It then remains to induce the host cell to incorporate tiie vector (transfection), and to grow the host 
cells In such a way as to effect the syntiiesis of the polypeptide desired as a concomitant of normal growth. 

Two typical problems are associated with the above-outiined procedure. First, it is desirable to have In 
tiie vector, in addition to tiie four essential elements outiined above, a maricer which will permit a 
straightfonward selection for those cells which have. In fact accepted the genetic expression vector. In 
using bacterial cells as hosts, frequently used maricers are resistance to an antibiotic such as tetracycline or 
ampicillin. Only those cells which are drug resistant will grow in cultures containing the antibiotic. 
Therefore, If the cell culture which has been sought to be transfected Is grown on a medium containing the 
antibiotic, only the cells actually transfected will appear as colonies. As the frequency of transformation is 
quite low (approximately 1 cell in 10® being transfected under Ideal conditions), this is almost an essential 

S5 prerequisite as a practical matter. , . * ^ i. « -.n3\ 

For vertebrate cells as hosts, the transformation rate achieved Is more efficient (about 1 cell In 10 ). 
However, facile selection remains Important In obtaining the desired transfected cells. Selection is 
rendered Important, also, because the rate of cell division Is about fifty times lower than in baceterial 
ceils— i.e., although £ coif divide once In about every 20-50 minutes, human tissue culture cells divide 
60- only once in every 12 to 24 hours. 

The present invention, in one aspect, addresses the problem of selecting for vertibrate cells which have 
taken up the genetic expression vector for the desired protein by utilizing expression of tiie coding 
sequence for a secondary protein, such, for example, as an essential enzyme In which the host cell is 
deficient For example, dihydrofolate reductase (DHFR) may be used as a maricer using host cells deficient 
cs In DHFR, 
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A second problem attendant on production of polypeptides in a foregin host is recovery of satisfactory 
quantities of protein. It would be desirable to have some mechanism to regulate, and preferably enhance, 
the production of the desired heterologous polypeptide. In a second aspect of the {nvention, a secondary 
coding sequence which can be affected by externally controlled parameters is titllized to allow control of 

5 expression by control of these parameters. Furthermore, provision of both sequences on a polycistrom in 
itself permits selection of transformants with high expression levels of the primary sequence. 

It has been shown that DHFR coding sequences can be introduced into, expressed in, and amplified in 
mammalian cells. Genomic DNA from methotrexate resistant Chinese Hamster Ovary (CHO) cells has been 
introduced into mouse cells and results in transformants which are also resistant to methotrexate (1). The 

10 mechanism by which methotrexate (MTX) resistance in mouse cells is developed appears to be threefold: 
through gene amplification of the DHFR coding sequence (2, 3, 4); through decrease in uptake of MTX (5, 6) 
and through reduction In affinity of the DHFR produced for MTX (7). 

It appears that amplification of the DHFR gene through MTX exposure can result in a concommitant 
amplification of a cotransfected gene sequence. It has also been shown that mouse fibroblasts, transfected 

IS with both a plasmid containing hepatitis B DNA sequences, and genomic DNA from a hamster cell line 
containing a mutant gene for MTX-resistant DHFR, secrete Increased amounts of hepatitis B surface 
antigen (HBsAg) into the medium when MTX is employed to stimulate DHFR sequence amplification (8). 
Further, mRNA coding for the £ cofi protein XGPRT is amplified in the presence of MTX in CHO cells 
co-transfected with the DHFR and XGPRT gene sequences under control by independent promoters (9). 

20 Rnally, Increased expression of a sequence endogenous to the promoter In a DHFR/SV40 plasmid 
combination In the presence of MTX has been demonstrated (10). 

It is known that viruses which can infect vertebrate cells often have more than one coding sequence 
under the control of a single promoter. However, these coding sequences are not expressed 
simultaneously, but rather they selectively are brought under the control of the promoter for transcription 

^ by splicing of the mRNA, for which purpose specific splice sites (donor and acceptor regions) are 
recognisable In the sequence. Examples of this can be seen in Eukaryot/c Viral Vectors, ed. Yakov Gluzman, 
CSHU 1982, pp 145—151 and pp 193—198. 

However, there Is no effective intervening splice site, so that translation of the second (downstream) 
coding sequence would not be expected. 

30 It Is also known that in vertebrate cells sometimes translation is not initiated at the first AUG codon in 
mRNA, but rather from an AUG somewhat downstream, which may be internal to the open reading frame 
(see for example Subramain et al, MoL and Cell. BfoL, vol. 2, pp 854—864). 

The present Invention is based on the discovery that, in vertebrate cell hosts, where the genetic 
expression vector for a desired polypeptide contains a secondary genetic coding sequence under the 

3S control of the same promoter but downstream of and separated from the primar/ coding sequence by 
translation stop and start codons and without any effective Intervening splice site, nevertheless some 
transfectants may express both sequences, albeit the second more weakly than the first This secondary 
sequence can therefore provide for a convenient screening marker, both for transformants In general, and 
for transformants showing high expression levels for the primary sequence, as well as serving as a control 

40 device whereby the expression of a desired polypeptide can be regulated, most frequentiy enhanced. 
This is particularly significant as the two proteins, according to the method of this. Invention, are 
produced separately in mature form. While both DNA coding sequences are controlled by the same 
transcriptional promoter, so that a fused message (mRNA) Is formed, they are separated by a translational 
stop signal for the first and start signal for the second, so that two independent proteins result 

4Ef As a vertebrate host cell culture system Is often advantageous because it Is capable of glycosylation, 
phosphorylation, and lipid association appropriate to animal systems (whereas bacterial hosts are not), it is 
dgnlficant that marker systems and regulating systems can be provided within this context 

The present Invention concems a method of selecting transfected vertebrate host cells for expression 
of a desired polypeptide through the use of a polycistronic expression vector which contains sequences 

so coding for a secondary protein and a desired protein, wherein both the desired and secondary sequences 
are governed by the same promoter. The coding sequences are separated by translational stop and start 
signal codons. The expression of the secondary sequence effects control over the expression of the 
sequence for the desired protein, and the secondary protein functions as a marker for selection of 
transfected cells. The Invention Includes use of secondary sequences having either or both of these effects. 

^ Brief description of the drawings 

Rgure 1 shows the construction of an expression vector for HBsAg, pE342.HS94.HBV. 
Rgure 2 shows the construction of an expression vector for DHFR, pE342.D22. 
Rgure 3 shows the construction of expression vectors for DHFR and HBsAg, pE342.HBV.D22 and 
pE342.HBV.E400.D22. 

60 

Detailed description and description of the preferred embodiment 
A. Definitions 

As used herein 

"Plasmids" Includes both naturally occuring plasmlds In bacteria, and artificially constructed circular 
es DNA fragments. 
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"Expression vector" means a plasmid which contains at least the four essential elements set forth 
hereinabove for the expression of the heterologous peptide In a host cell culture. 

"Heterologous protein" means a protein or peptide which is not normally produced by, or required for 
the viability of, the host organism. i_ j r l • 

5 "Desired protein" means a heterologous protein or peptide which the method of the mvention is 

designed to produce. • ,j . ^ 

"Secondary peptide" means the protein or peptide which is not the heterologous peptide desired as 
the primary product of the expression in the host cell, but rather a different heterologous peptide which, by 
virtue of its own characteristics, or by virtue of the characteristics of the sequence coding for it is capable of 
to "marking" transfection by the expression vector and/or regulating the expression of the primanly desired 

heterologous peptide, , ^ , . . -j * u * -iaaa 

The peptide sequence may be either long or short ranging from about 6 ammo acids to about 1000 
amino acids. The conventional distinction between the words peptide and protein is not routinely observed 
in the description of the invention. If the distinction Is to be made, it will be so specified. 
IS "Primary sequence" is the nucleotide sequence coding for the desired peptide, and 

"Secondary sequence" means a sequence of nucleotides which codes for the secondary peptide. 
'Transfection" of a host cell means that an expression vector has been taken up by the host cell in a 
detectable manner whether or not any coding sequences are In fact expressed. In the context of the present 
invention, successful transfection will be recognized when any Indication of the operation of this vector 
20 wititin the host cell Is realized. It Is recognized tiiat there are various levels of success witiiin its context 
Rret, the vector's coding sequence may or may not be expressed. If the vector Is properly constructed with 
inclusion of promoter and terminator, however. It Is highly probable tiiat expression will occur. Second, if 
the plasmid representing tiie vector is taken up by the cell and expr^sed, but fails to be Incorporated 
within the normal chromasomal material of the cell, the ability to express tins plasmid will be lost after a 
25 few generations. On tiie other hand, if the vector is taken up within the chromosome, the expression 
remains stable through repeated replications of tiie host cell. There may also be an intermediate result The 
precise details of the manner in which transfection can thus occur are not understood, but It is clear that a 
continuum of outcomes Is found experimentally in terms of the stability of the expression over several 
generations of the host culture. 

B. A preferred embodiment of the desired peptide 

In a preferred specific embodiment exemplary of the invention herein, the pnmary genetic sequerice 
encodes the hepatitis B-surface antigen (HBsAg). This protein is derived from hepatitis B virus, the infective 
agent of hepatitis 6 In human beings. This disease is characterized by debilitation, liver damage, pnmary 
carcinoma, and often death. The disease is reasonably widespread especially In many African and Asian 
countries, where many people are chronic carriers witii the potential of transmitting the disease 
pandemically. The virus (HBV) consists of a DMA molecule surrounded by a nuclear capsid, in turn 
surrounded by an envelope. Proteins which are associated witii the virus include the surface antigen 
(HBsAg), a core antigen, and a DNA polymerase. The HBsAg is known to produce antibodies in Infected 
people. HBsAg found in tiie serum of infected Individuals consists of protein particles which average ca- 22 
nanometers In diameter, and are tiius called "22 nanometer particles". Accordingly, It Is believed tiiat tiie 
HBsAg particle would be an effective basis for a vaccine. ^ 

C A preferred embodiment of the secondary peptide * „ . . ... x 

45 It has been recognized that environmental conditions are often effective in controlling the quantity ot 
particular enzymes that are produced by cells under certain growtii conditions. In the preferred 
embodiment of the present invention, advantage Is taken of the sensitivity of certain cells to methotrexate 
(ivrrx) which Is an inhlbhor of dihydrofolate reductase (DHFR). DHFR Is an enzyme which is required, 
indirectly. In synthesis reactions Involving tiie transfer of one carbon units. Lack of DHFR activity results in 
so inability of cells to grow except in tfie presence of those compounds which otiierwise require transfer of 
one cartjon units for their synthesis. Cells lacking DHFR, however, will grow in the presence of a 
combination of glycine, thymidine and hypoxanthine. ^ x *u *: 

Cells which normally produce DHFR are known to be inhibited by metiiotrexate. Most of the time, 
addition of appropriate amounts of metiiotrexate to normal cells will result in the death of the cells. 
However, certain cells appear to survive the methotrexate treatment by making increased amounts of 
DHFR, tiius exceeding the capacity of the metiiotrexate to inhibit ttiis enzyme (2, 3, 4). It has been shown 
previously tiiat In such cells, there Is an increased amount of messenger RNA coding for the DHFR 
sequence. This is explained by assuming an Increase in tiie amount of DNA in tiie genetic matenal coding 
for this messenger RNA. In effect apparentiy tiie addition of methotrexate causes gene amplification of the 
DHFR gene. Genetic sequences which are physically connected with tiie DHFR sequence although not 
regulated by the same promoter are also amplified (1, 8, 9, 10). Consequently, it is possible to use ttie 
amplification of tiie DHFR gene resulting from metiiotrexate treatment to amplify concomitantiy the gene 
for anotiier protein, in this case, the desired peptide, , ^ ^ ^ 

Moreover, If tiie host cells Into which the secondary sequence for DHFR Is introduced are themselves 
DHFR deficient DHFR also serves as a convenient maricer for selection of cells successfully transfected. If 
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the DHFR sequence Is effectively connected to the sequence for the desired peptide, this ability serves as a 
marker for successful transfection with the desired sequence as well. 

D. Vector construction techniques employed (materials and methods) 

s The vectors constructed in the Examples set forth in E are constructed by cleavage and ligation of 
isolated plasmids or ONA fragments. 

Cleavage Is performed by treating with restriction enzyme (or enzymes) in suitable buffer. In general, 
about 20 pg plasmid or DNA fragments require about 1 — 5 units of enzyme in 200 pi of buffer solution. 
(Appropriate buffers for particular restriction enzymes are specified by the manufacturer). Incubation times 
10 of about 1 hour at 37X are workable. After incubations, protein is removed by extraction with phenol and 
chloroform, and the nucleic acid recovered from the aqueous fraction by precipitation with ethanol. 

If blunt ends are required, the preparation is treated for 15 minutes at 15° with 10 units of Polymerase I 
(iCIenow), phenol-chloroform extracted, and ethanol predpitated. 

Size separation of the cleaved fragments is performed using 6 percent polyacrytamide gel described by 
IB Goeddel, D. et a/^ Nucleic Acids Res 8; 4057 (1980) incorporated herein by reference. 

For ligating approximately equimolar amounts of the desired components, suitably end tailored to 
provide correct matching are treated with about 10 units T4 DNA ligase per 0.5 pg DMA. 

E. Detailed description of a preferred embodiment: 

20 In general, the expression vector suitable for the present invention is constructed by adaptation of 
gene splidng techniques. The starting material is a naturally occuring bacterial plasmid, previously 
modified, if desired. A preferred embodiment of the present invention utilizes a pML plasmid which is a 
modified pBR 322 plasmid prepared according to Lusky, M. etai^ Nature 239:79 (1981) which is provided 
with a single promoter, derived from the simian virus SV-40 and the coding sequence for DHFR and for 

25 HBsAg. 

In the construction, the promoter (as well as a ribosome binding sequence) is placed upstream from 
the coding sequence coding for a desired protein and one coding for a secondary protein. A single 
transcription termination sequence is downstream from both. At the end of the upstream code sequence is 
placed a translational stop signal; a translational start signal begins the downstream sequence. Thus, 
30 expression of the two coding sequences results in a single mRNA strand, but two separate mature proteins. 

In a particularly preferred embodiment, the sequence coding for the secondary peptide is downstream 
from that coding for the desired peptide. Under these circumstances, procedures designed to select for the 
cells transformed by the secondary peptide will also select for particulariy enhanced production of the 
desired peptide. 

35 

F. Examples 

The following examples are intended to illustrate, but not limit the invention. 
Example 1 

40 Vector containing the HBsAg sequence, pE342.HS94.HBV 

Rgure 1 shows the construction of the HBsAg plasmid. 

The 1986 bp EcoRI-Bglll fragment which spans the surface antigen gene was isolated from the HBV 
viral genome doned with pBR322 as described by Uu et al., DNA /:213 (1982), incorporated herein by 
reference. This sequence was ligated between the EcoRI and BamHl sites of pML, a pBR322 derivative 

45 which lades sequences inhibitory to its replication in simian cells, as described by Lusky et ai^ Nature 
293:73 (1981), Incorporated herein by reference. Into the single EcoRI site of the resulting plasmid was 
inserted the 342 bp origin fragment of SV40 obtained by Hlndlll Pvull digestion of the virus genome, which 
had been modified to be bounded by EcoRI restriction sites resulting in p342E (also referred to as 
pHBs348-E) as described by Levinson etaf,, patent application Serial No. 326,980, filed December 3, 1981, 

SO which is hereby incorporated by reference (EPO Publication No. 0073656). (Briefly, the origin of the Simian 
virus SV40 was Isolated by digesting SV40 DNA with Hindlll, and converting the HindlU ends to EcoRI ends 
by the addition of a converter (AGCTGAATTC). This DNA was cut with Pvull, and Rl linkers added. 
Following digestion with EcoRI, the 348 base-patr fragment spanning the origin was isolated by 
polyacrylamide gel electrophoresis and electroelution, and doned in pBR322. Expression plasmid 

55 pHBs348-E was constructed by cloning the 1986 base-pair fragment resulting from EcoRI and Bglll 
digestion of HBV lAnimaf Vims Genetics, (CH. 5) Acad. P.ress, N. Y. (1980) (which spans the gene encoding 
HBsAg) Into the plasmid pML (Lusky etai^ Nature 293:7B, 1981) at the EcoRI and BamHl sites. (pML Is a 
derivative of pBR322 which has deletion eliminating sequences which are inhibitory to plasmid replication 
in monkey cells). The resulting plasmid (pRI-Bgl) was then linearized with EcoRI, and the 348 base-pair 

gQ fragment representing the SV40 origin region was introduced into the EcoRI site of pRI-Bgl. The origin 
fragment can Insert in either orientation. Since this fragment encodes both the eariy and late SV40 
promoters In addition to the origin of replication, HBV genes could be expressed under the control of either 
promoter depending on this orientation (pHBS348-E representing HBs expressed under control of the early 
promoter), pE342 is modified by partially digesting with EcoRI, filling In the cleaved site using Klenow DNA 

55 polymerase 1, and ligating the plasmid back together, thus removing the EcoRI site preceding the SV40 
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origin in pE342, The resulting plasmid, designated pE342AR1, is digested with EcoRI, filled in using Klenow 
ONA polymerase I, and subcut with BamHI. After electrophoresing on acrylamide gel, the approximately 
3500 bp fragment Is electroeluted, phenol-chloroform extracted, and ethanol precipitated as above). The 5' 
nontranslated leader region of HBsAg was removed by treatment with EcoRI and with Xba, and the 

5 analogous 150 bp EcoRI-Xba fragment of a hepatitis expression plasmid pHS94 (Liu et al. (supra)) was 
Inserted in Its place to create pE342,HS94.HBV. . i,„ . 

(As described by Liu, et aL pHS94 contains the translational start codon of the authentic HBsAg gene, 
but lacks all 5' nontranslated message sequences. The levels of expression of both the authentic EcoRI-Bglll 
and pHS94 derived equivalent under control of the SV40 early promoter as described above are equivalent 

10 and are Interchangeable without affecting the performance of the plasmid). 



Example 2 

Vector containing the DHFR sequence, pE342.D22 ^ ^. * u-^u 

A plasmid carrying DHFR as the only expressable sequence Is pE348.D22, the construction of which is 

IS J'jgJfl^JJ , i^ggrt ^j^g DHFR cDNA plasmid DHFR-1 1 (Nunberg et al.. Cell 73:355, 1980) was 

treated with the exonuclease Bal31 in order to remove the poly G:C region adjacent to the Pst I s,tes, 
digested with Bglll and the resulting fragments of approximately 660 bp Isolated from gels. The Bal31-Bglll 
digested cDIMA was ligated into a pBR322 plasmid derivative containing a Bglll site. (Following digestion of 

20 PBR322 with Hind III, the plasmid fragment was filled In using Klenow DMA polymerase in the Presence of 
the four deoxynucleotide triphosphates, and subcut with Bglll). The resulting plasmid, PDHFW)22, hasan 
EcoRI site situated 29 bp upstream of the fusion site between pBR322 and the B' end of the DHFR cDNA. The 
EcoR l-Bfllil fragment encompassing the coding sequences of the cDNA Insert was therj excised from 
PDHFR-D22 and ligated to EcoRI-BamHI digested pE342.HBV (Example 1), creating the DHFR expression 

2S plasmid pE342.D22. 

Example 3 

Vectors containing both DHFR and HBsAg sequences u • *u nuco ««« 

Two such vectors were constructed, pE342.HBV.D22 contaming a polycistron wherein the DHFR gene 
30 is downstream from the HBsAg gene, and pE342.HBV.E400.D22, (Rgure 3) in which the genes coding for 
DHFR and HBsAg are not polycistronlc. 

A. pE342.HBV.D22 was constructed by ligating the EcoRI-TaqI fragment of cloned HBV DNA (Uu et aL 
(supra)), to EcoRI-Clal digested pE342.D22. ^ „ .„ 

B. This plasmid was further modified by fusing an additional SV40 early promoter between the Bglll 
55 site and the Clal site of the DHFR Insert of pE342.HBV.D22, creating pE342.HBV£400.D22. 

HBV viral DNA contains a single TaqI site 20 bp beyond the Bglll site that was used to generate the 
EcoRI-Bglil fragment encompassing the surface antiflen gene. Thus, EcoRI and TaqI digestion of doned 
HBV viral DNA results in a fragment of -2000 bp spanning the surface antigen gene, and containing a 
single Bglll site (1985 bp from the EcoRI site (Uu etaL (supra)). (The ends of DNA fragments TaqI and Clal 
40 generated by digestion are cohesive, and will ligate together). . , , ^ 

The Clal site is regenerated; thus pE342.HBV.D22 contains both a Bglll and Clal site, which are situated 
Immediately in front of the DHFR coding sequences. ^ ^. . . . ^bv/ f^oo 

An SV40 origin bounded by restriction sites cohesive with the Bglll and Clal sites of pE342.HBV.D22 
was constructed by digesting SV40 DNA with Hpall, filling In as described above, and subcutting v^h 
4S Hindlll. A 440 bp fragment spanning the origin was Isolated. This was ligated, in a tripartite ligation, to the 
4000 bp pBR322 fragment generated by Hindlll and BamHI digestion, and the 1986 bp fragment spanning 
the surface antigen gene generated by digesting the cloned HBV viral DNA with EcoRI, filling In with 
Klenow DNA polymerase 1, subdigesting with Bglll, and Isolating on an acrylamide gel. Ugatlon of all three 
fragments Is achievable only by joining of the filled in Hpall with EcoRI, the two Hindlll sites with each other 
and the Bglll with BamHI, Thus when the resulting plasmid is restricted with Clal and BamHI, a 470 bp 
fragment Is obtained which contains the SV40 origin. This fragment Is Inserted into the Clal and Bglll sites 
of pE342.HBV.D22, (paragraph A) creating pE342.HBV.E400,D22 (Figure 3). 

Example 4 

« Transfeclion of host cells ^ ^ , . , *u 

The host cells herein are vertebrate cells grown In tissue culture. These cells, as is known In the art, can 
be maintained as permanent cell lines prepared by successive serial transfers from isolatwl normal cells. 
These cell lines are maintained either on a solid support in liquid medium, or by growth in suspensions 
containing support nutrients. ^ , 

In the preferred embodiment, CHO cells, which were deficient in DHFR a^^vj^V "^ed. TTiese c^ 
prepared and propagated as described by Uriaub and Chasin, Proc NatL Acad. Scl (USA) 77:4216 (1980), 
which is incorporated herein by reference. ^i. ^ x ^ «u«_ 

The cells are transfected with 5 mg of desired vector as prepared above usmg the method of Graham 
and Van Der Eb, Virology 52:456 (1978) Incorporated herein by reference. „ 
^ The method Insures the interaction of a collection of plasmids with a particular host cell, thereby 



60 



7 



EP 0117 058 B1 



increasing the probability that if one plasmid Is absorbed by a cell, additional plasmids would be absorbed 
as well. Accordingly, it is practicable to introduce both the primary and secondary coding sequences using 
separate vectors for each, as well as by using a single vector containing both sequences. 

5 Example 5 

Growth of transfected cells and expression of peptides 

The CHO cells which were subjected to transfection as set forth above were first grown for two days in 
non-selective medium, then the cells were transferred into medium lacking glycine, hypoxanthine, and 
thymidine, thus selecting for cells which are able to express the plasmid DHFR. After about 1 — 2 weeks, 

ro individual colonies were isolated with cloning rings. 

Cells were plated in 60 or 100 mm tissue culture dishes at approximately .5x10^ cells/dish. After 2 days 
growth, growth medium was changed. HBsAg was assayed 24 hours later by RIA (Ausria II, Abbott). Cells 
were counted and HBsAg production standardized on a per cell basis. 10—20 random colonies were 
analyzed In this fashion for each vector employed. 

rs In one example of the practice of the invention, the following results were obtained: 

Transfectional 

efficiency of HBsAg production; ng/10® cells/day 

Dhfr" cells (percent of colonies in given range) 

20 (colonies/ug/ 



Vector 


10^ cells) 


0 


O-10 


10—100 


100-500 


500-1500 


>1500 


pE342.D22 


335 


100 


0 


0 


0 


0 


0 


PE342.HS94 


<.2 


1 


1 


1 


1 


1 


1 


pE342.D22+pE34ZHS94 


340 


0 


50 


30 


20 


0 


0 


pE342.HBV.D22 


20 


0 


0 


0 


0 


55 


45 


pE342.HBV.E400.D22 


510 


0 


17 


17 


58 


8 


0 



The production of surface antigen in several of the highest expressing cell lines has been monitored for 
greater than 20 passages and Is stable. The cells expressing the surface antigen remain attached to the 
35 substratum indefinitely and will continue to secrete the large amounts of surface antigen as long as the 
medium is replenished. 

It is clear that ^e polycfstronic gene con^ction results in isolation of the cells producing the highest 
levels of HBsAg. 100 percent of colonies transfonmed with pE342.HBV.D22 produced over 500 ng/10^ 
cells/day whereas 92 percent of those transformed with the non-polyclstronic plasmid 
40 pE342>IBV.E400.D22 produced less than that amount Onfy cells from the polycisuonic transfection 
demonstrated production levels of more than 1500 ng/10° cells/day. 

Example 6 

Treatment with methotrexate 

45 The surface antigen expressing cell lines are inhibited by methotrexate (MTX), a specific inhibitor of 
DHFR at concentrations greater than 10 nM. Consistent with previous studies on the effects of MTX on 
tissue culture cells, occasional clones arise which are resistant to higher concentrations (50 nM) of MTX at a 
frequency of approximately 10*^. However, these clones no longer produce surface antigen despite the 
amplification of HBV sequences in the MTX resistant clones. Thus, the HBV gene is amplified, though 

so expression falls off in this case. This suggests that further production of surface antigen may be lethal to 
the cell. 

Example 7 

Recovery of desired peptide 

55 The surface antigen produced is in the form of a particle, analogous to the 22 nm particle observed in 
the serum of patients infected with the virus. This form of antigen has been shown to be highly 
Immunogenic. When the cells are grown In medium lacking calf serum or other supplements, 
approximately 10 percent of the protein contained in the medium is surface antigen and this protein can be 
Isolated by methods known in the art The surface antigen com ig rates on SDS-polyacrylamide gels with the 

so 22 nm particle derived protein. 

References 

1, Wigler, M. et aL, Proc Nati. Acad. Set, 77:3B67 (1980) 

2. Schimke, Robert T. et aL, Sdence 202'A0S^ (1978) 
ss 3. Biedler, J. L et aL, Cancer Res. 32:153 (1972) 



8 



EP 0117 058 B1 

4. Chang, S. E. "et aL Cell 7:391 (1976) 

5. Fischer, G. A., Biochem Pharmacol, 1VA222 (1962) 

6. Sirotnak, F. M, et al.. Cancer Res. 2fl:75 (1968) 

7. Rintoff, W. F. et aL, Somat CelL Genet 2:245 (1976) 

8. Cristman, J, et ah, Proc NatL Acad. Sa\ 7S:1815 (1982) 

9 Ringold, Gordon et aL, J, Molec and AppL Gen. /:165 (1981) 

10, Kaufman R, F. et al., J. Molec Biol. 759:60^ (1982) 

11. Perucho, Manuel et al., Ce// 22:309 (1980) 



70 Claims 



1 A method of selecting transfected vertebrate host cells for expression of a desired polypeptide, 
which method comprises transfecting the vertebrate cells with an expression vector comprising a promoter 
operable in a vertebrate host cell and first and second polypeptide coding sequences under the control of 
IS said promoter, said coding sequences being separated by a translational stop signal and a translational 
start signal without any intervening splice site which is effective In the host cell and selecting transfectants 
which exhibit expression of the second polypeptide accompanied by a higher level of expression of said 

^'"^^Z A^meSod ac^ to Claim 1, wherein the second polypeptide is capable of marking transfection 
20 by the expression vector and/or regulating the expression of the first Poiypeptlde. 

3 A method according to Claim 1 or Claim 2 wherein the promoter is the SV40 eariy promoter. 

4. A method according to any one of the preceding claims wherein the transfectants are grown under 
selective culture conditions favouring expression of the second polypeptide, 

5. A method according to any one of the preceding claims wherein the second polypeptide is DHFR. 
25 6 A method according to Claim 5 wherein the host cells are deficient in DHFR. 

7! A method according to Claim 6 or Claim 6 wherein the transfected host cell is grown in the presence 

of a DHFR inhibitor. 

8. A method according to Claim 7 wherein the DHFR inhibitor is methotrexate* 

9. A method according to any one of the preceding claims wherein the host cells are CHO cells, 

10. A method for producing a desired polypeptide which comprises culturing transfected vertebrate 
host cells obtained according to any one of the preceding claims so as to express said first polypeptide 
coding sequence, said first polypeptide being the desired polypeptide. 

Patentanspruche 

1 Verfahren zur Selektion von transflzlerten Wlrbeltier-Wirtszelien fur die Expression eines 
gewQnschten Polypeptides, welches Verfahren das Transfizieren der Wirbeltieraellen mit einem 
Expressionsvektor, dor einen Promoter, der in einer Wirbeltierwirtszelle operabel ist, und erste und zweite 
Polypeptldkodierungssequenzen unter der Steuerung des genannten Promoters enthSlt wobei die 
aenannten Kodierungssequenzen durch ein Translationsstopsignal und ein Translationsstartsignal ohne 
Intervenierende Spleilistelle getrennt sind, und der In der Wirtszelle wirict, und die Selektion von 
Transfektanten, die die Expression des zweiten Polypeptides, begleitet von einem hqheren 
Expressionswert des genannten ersten Polypeptides zeigen, umfaSt 

2. Verfehren nach Anspruch 1, worin das rweite Polypeptid fShig ist, die Transfektion durch den 
Expressionsvektor zu marideren und/oder die Expression des ersten Polypeptides zu steuem. 
3 Verfahren nach Anspruch 1 oder 2, worin der Promoter der friihe SV40 Promoter 1st 

4. Verfahren nach einem der vortiergehenden AnsprOche, worin die Transfektanten unter selektiven 
Kuiturtjedingungen kultivlert werden, die die Expression des zweiten Polypeptides fdrdern. 

5. Verfahren nach einem der vortiergehenden AnsprOche, worin das zweite Polypeptid DHFR ist 

6. Verfahren nach Anspruch 5, worin die Wirtzsellen arm an DHFR sind, 

7. Verfahren nach Anspruch 5 oder 6, worin die tranefizierte Wirtszelle in Gegenwart eines 
DHFR-lnhibitors kultiviert wird, 

8. Verfahren nach Anspruch 7, worin der DHFR Inhibitor (Vlethotrexat 1st „ 
9 Verfahren nach einem der vortiergehenden AnsprOche, worin die Wirtzellen CHO-Zellen sind, 
10. Verfahren zur Herstellung eines gewQnschten Polypeptides, welches das Kultivieren yon 

transfizierten Wirbeleiter-Wirtszellen umfaBt, die nach einem der vorhergehenden Anspruche ertialten 
warden, um die genannte erste Polypeptld-Kodierungssequenz zu exprimieren, wobei das genannte erste 
Polypeptid das gewunschte Polypeptid ist. 
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SO Revendications 

1 Proc6d6 de selection de cellules hdtes transfect6es de vert6br6s pour I'expression d'un polypeptide 
souhalt^, iequel proc6d6 comprend la transfection des cellules de vert6br6s avec un vecteur d'expression 
comprenant un promoteur utilisabio dans une cellule hote de vert6br6 et des premiere et seconde 
6s sequences de codage de polypeptide sous le contr6le dudit promoteur, lesdites s6quences de codage etant 
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s§par^es par un signal d'arret de traduction et un signal de dSbut de traduction sans aucin site 
intermSdtaIre d'^pissage qui est efficace dans la cellule hdte et la selection de transfectants qui pr^sentent 
I'expression du second polypeptide accompagnee d'un niveau sup6rieur d'expression dudit premier 
polypeptide. 

^ 2. Proc6de selon la revendication 1 oO le second polypeptide est capable de marquer la transfection par 
le vecteur d'expression et/ou de regular I'expression du premier polypeptide. 

3. Proc4de selon la revendication 1 ou la revendication 2 oCi le promoteur est le promoteur precoce de 
SV40. 

4. Proc6d§ selon Tune quelconque des revendications pr6c6dentes. oCi . les transfectants sont 
'0 d6velopp6s en conditions de culture selective favorisant I'expression du second polypeptide. 

5. Proc6d6 selon I'une quelconque des revendications pr6c^dentes oO le second polypeptide est DHFR. 

6. Proced^ selon la revendication 5 oii les cellules hdtes sont d^ficientes en DHFR. 

7. Procdd^ selon la revendication 5 ou la revendication 6 oCi la cellule hdte transfect^e est ddvelopp^e 
en presence d'un inhibiteur de DHFR. 

IS 8. Proc^dS selon la revendication 7 oCi I'inhibiteur de DHFR est le methotrexate. 

9. Proc^dd selon I'une quelconque des revendications pr6c6dente3 oO les cellules hdtes sont des 
cellules de CHO. 

10. Proc^dS de production d'un polypeptide souhaite, qui comprend la mise en culture de cellules 
hdtes trarisfect^es de vert^bres obtenues selon I'une quelconque des revendications precedentes afin 

20 d'expn'mer la sequence codant ledtt premier polypeptide, ledit premier polypeptide 6tant le polypeptide 
souhattS. 
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